Model Selection

4-bit Quantization

# 4-bit Quantization

Josiefied Qwen3 30B A3B Abliterated V2 4bit

This is a 4-bit quantized version converted from the Qwen3-30B model, suitable for text generation tasks on the MLX framework.

Large Language Model

Medgemma 27b Text It 4bit

MedGemma-27B-Text-IT-4bit is an MLX-format model converted from Google's MedGemma-27B-Text-IT model, specifically optimized for medical and clinical reasoning tasks.

Large Language Model

Moondream 2b 2025 04 14 4bit

Moondream is a lightweight vision-language model designed for efficient cross-platform deployment. The 4-bit quantized version released on April 14, 2025 significantly reduces memory usage while maintaining high accuracy.

A Turkish law-specific language model based on Qwen3-14B, fine-tuned using the LoRA method and employing 4-bit quantization technology

Large Language Model Supports Multiple Languages

QWEN 3B INSTRUC Medical COT SFT 2kstep 4kcol

A 3B parameter instruction fine-tuned model based on Qwen2.5 architecture, optimized for training speed using Unsloth and Huggingface TRL library

Large Language Model

Transformers English

hailong18102002

Qwen3 4B Rpg Roleplay

A roleplay dialogue model fine-tuned based on Qwen3-4B, excelling in generating coherent dialogues that align with character traits

Large Language Model English

Mistral 7B Instruct V0.3 Forensics V1

This model is a fine-tuned version optimized from Mistral-7B-Instruct-v0.3, specifically designed for Q&A tasks in the field of forensic investigations, supporting advanced forensic reasoning and rapid knowledge retrieval.

Large Language Model

UI TARS 1.5 7B 4bit

UI-TARS-1.5-7B-4bit is a multimodal model focused on image-text-to-text conversion tasks, supporting the English language.

Transformers Supports Multiple Languages

VL Rethinker 72B 4bit

VL-Rethinker-72B-4bit is a multimodal model based on Qwen2.5-VL-7B-Instruct, supporting visual question answering tasks, and has been converted to MLX format for efficient operation on Apple devices.

Transformers English

Gemma 3 27b It Qat 4bit

Gemma 3 27B IT QAT 4bit is an MLX-format model converted from Google's original model, supporting image-to-text tasks.

Transformers Other

Space Voice Label Detect Beta

Fine-tuned version based on Qwen2.5-VL-3B model, trained using Unsloth and Huggingface TRL library, achieving 2x inference speed improvement

Transformers English

Qwen2.5 Omni 7B GPTQ 4bit

A 4-bit GPTQ quantized version of the Qwen2.5-Omni-7B model, supporting multilingual and multimodal tasks.

Multimodal Fusion

Safetensors Supports Multiple Languages

Gemma 3 27b It Abliterated Mlx 4Bit

This is a 4-bit quantized version converted from the mlabonne/gemma-3-27b-it-abliterated model, optimized for the MLX framework.

Large Language Model

Llama model trained with Unsloth and Huggingface TRL library, achieving 2x inference speed improvement

Large Language Model

Transformers English

Llama 3.2 11B Vision Radiology Mini

Vision instruction fine-tuned model optimized with Unsloth, supporting multimodal task processing

Transformers English

Sales Conversations Unsloth Llama 3.1 8B Instruct

4-bit quantized version based on Meta-Llama-3.1-8B-Instruct, efficiently trained using Unsloth and TRL libraries

Large Language Model

Transformers English

Qwen2 Audio 7B Instruct 4bit

This is the 4-bit quantized version of Qwen2-Audio-7B-Instruct, developed based on Alibaba Cloud's original Qwen model. It is an audio-text multimodal large language model.

Llama3.1 8b Instruct Summarize Q4 K M

A 4-bit quantized version based on Meta-Llama-3.1-8B-Instruct, trained using Unsloth and Huggingface TRL libraries, achieving 2x speed improvement.

Large Language Model English

Qwen2 1.5B Summarize

A specialized summarization model fine-tuned for 2 rounds based on Qwen2-1.5B-Instruct

Text Generation

Transformers English

thepowerfuldeez

Omost Dolphin 2.9 Llama3 8b 4bits

Omost's instruction fine-tuned model based on Llama3-8B, pre-trained with the Dolphin-2.9 dataset and quantized in 4-bit NF4 format.

Large Language Model

Llama3 8B Medical

A 4-bit quantized version of the LLAMA-3-8B model fine-tuned for medical Q&A

Large Language Model

Transformers English

Llama3 Toxic 8B Float16

A text generation model fine-tuned based on unsloth/llama-3-8b-bnb-4bit, trained using Unsloth and TRL libraries with 2x speed improvement

Large Language Model

Transformers English

Llama 3 70B Uncensored

This is a text generation model fine-tuned using Unsloth and TRL libraries on the Llama-3-70B model, achieving 2x faster training speed.

Large Language Model

Transformers English

Cogvlm Grounding Generalist Hf Quant4

CogVLM is a powerful open-source vision-language model supporting tasks like object detection and visual question answering, featuring 4-bit precision quantization.

Tinyllama NSFW Chatbot

A fine-tuned language model based on the 4-bit quantized version of TinyLLaMA, efficiently trained using Unsloth and TRL libraries

Large Language Model

Transformers English

Internlm Xcomposer2 7b 4bit

InternLM-XComposer2 is a vision-language large model (VLLM) based on InternLM2, featuring advanced image-text understanding and creation capabilities.

Internlm Xcomposer2 Vl 7b 4bit

A vision-language large model based on InternLM2, with outstanding image-text understanding and creation capabilities

Meditron 7B AWQ

Meditron 7B is a large language model in the medical field developed by the EPFL LLM Team. It is further pre-trained based on Llama-2-7B and focuses on medical knowledge encoding and clinical decision support.

Large Language Model

Transformers English

Llama 2 7b Mt French To English

A LoRA adapter fine-tuned on the Meta Llama 2 7B model, specifically designed for French-to-English text translation tasks.

Machine Translation Supports Multiple Languages

Pygmalion 6b 4bit 128g

A 4-bit GPTQ quantized model based on Pygmalion-6B, suitable for dialogue generation tasks, supporting English text generation

Large Language Model

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase